A Novel Four-Way Approach Designed With Ensemble Feature Selection for Code Smell Detection

نویسندگان

چکیده

Purpose: Code smells are residuals of technical debt induced by the developers. They hinder evolution, adaptability and maintenance software. Meanwhile, they very beneficial in indicating loopholes problems bugs Machine learning has been extensively used to predict Smells research. The current study aims optimise prediction using Ensemble Learning Feature Selection techniques on three open-source Java data sets. Design Results: work Compares four varied approaches detect code performance measures Accuracy(P1), G-mean1 (P2), G-mean2 (P3), F-measure (P4). found out that values did not degrade it instead either remained same or increased with feature selection Learning. Random Forest turns be best classifier while Correlation-based selection(BFS) is amongst techniques. aggregators, i.e. ET5C2 (BFS intersection Relief Forest), ET6C2 union ET5C1 Bagging) Majority Voting give results from all aggregation combinations studied. Conclusion: Though good, but needs a lot validation for variety sets before can standardised. also pose challenge concerning diversity reliability hence exhaustive studies.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ensemble Classification and Extended Feature Selection for Credit Card Fraud Detection

Due to the rise of technology, the possibility of fraud in different areas such as banking has been increased. Credit card fraud is a crucial problem in banking and its danger is over increasing. This paper proposes an advanced data mining method, considering both feature selection and decision cost for accuracy enhancement of credit card fraud detection. After selecting the best and most effec...

متن کامل

MLIFT: Enhancing Multi-label Classifier with Ensemble Feature Selection

Multi-label classification has gained significant attention during recent years, due to the increasing number of modern applications associated with multi-label data. Despite its short life, different approaches have been presented to solve the task of multi-label classification. LIFT is a multi-label classifier which utilizes a new strategy to multi-label learning by leveraging label-specific ...

متن کامل

A Novel Ensemble Approach for Anomaly Detection in Wireless Sensor Networks Using Time-overlapped Sliding Windows

One of the most important issues concerning the sensor data in the Wireless Sensor Networks (WSNs) is the unexpected data which are acquired from the sensors. Today, there are numerous approaches for detecting anomalies in the WSNs, most of which are based on machine learning methods. In this research, we present a heuristic method based on the concept of “ensemble of classifiers” of data minin...

متن کامل

A Novel Approach to Feature Selection Using PageRank algorithm for Web Page Classification

In this paper, a novel filter-based approach is proposed using the PageRank algorithm to select the optimal subset of features as well as to compute their weights for web page classification. To evaluate the proposed approach multiple experiments are performed using accuracy score as the main criterion on four different datasets, namely WebKB, Reuters-R8, Reuters-R52, and 20NewsGroups. By analy...

متن کامل

Genetic Approach to Feature Selection for Ensemble

Ensembles of classiiers have been shown to be very eeective for case-based classiication tasks. The vast majority of ensemble construction algorithms use the complete set of features available in the problem domain for the ensemble creation. Recent work on randomly selected subspaces for ensemble construction has been shown to improve the accuracy of the ensemble considerably. In this paper we ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3049823